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Abstract. We present the fastest FPRAS for counting and randomly 
generating simple graphs with a given degree sequence in a certain range. 
For degree sequence with maximum degree d m ax = 0(m 1 ^ 4 ~ T ), 

our algorithm generates almost uniform random graph with that degree 
sequence in time 0{md m &x) where m — 5 £\ di is the number of edges 
in the graph and r is any positive constant. The fastest known FPRAS 
for this problem [26] has running time of 0(m 3 n 2 ). Our method also 
gives an independent proof of McKay's estimate [37] for the number of 
such graphs. 

Our approach is based on sequential importance sampling (SIS) tech- 
nique that has been recently successful for counting graphs [18, 13, 12]. 
Unfortunately validity of the SIS method is only known through simu- 
lations and our work together with [12] are the first results that analyze 
the performance of these methods. 

Moreover, we show that for d — 0(n 1//2 ~ r ), our algorithm can generate 
an asymptotically uniform d-regular graph. Our results are improving 
the previous bound of d = 0(n 1//3_T ) due to Kim and Vu [34] for regular 
graphs. 



1 Introduction 

The focus of this paper is on generating random simple graphs (graphs with 
no multiple edge or self loop) with a given degree sequence. Random graph 
generation has been studied extensively as an interesting theoretical problem (see 
[47, 13] for detailed surveys). It has also become an important tool in a variety of 
real world applications including detecting motifs in biological networks [41] and 
simulating networking protocols on the Internet topology [46,23,36,17,4]. The 
most general and well studied approach for this problem is the Markov chain 
Monte Carlo (MCMC) method [26-28,19,21,30,10]. However, current MCMC- 
based algorithms have large running times, which make them unusable for real- 
world networks that have tens of thousands of nodes (for example, see [24]). This 
has constrained practitioners to use simple heuristics that are non-rigorous and 
have often led to wrong conclusions [40, 41]. Our main contribution in this paper 
is to provide a much faster fully polynomial randomized approximation scheme 



(FPRAS) for generating random graphs; this we can do in almost linear time. 
An FPRAS provides an arbitrary close approximaiton in time that depends only 
polynomially on the input size and the desired error. (For precise definitions of 
this, see Section 2.) 

Recently, sequential importance sampling (SIS) has been suggested as a more 
suitable approach for designing fast algorithms for this and other similar prob- 
lems [18,13,35,6]. Chen et al. [18] used the SIS method to generate bipartite 
graphs with a given degree sequence. Later Blitzstein and Diaconis [13] used 
a similar approach to generate general graphs. Almost all existing work on SIS 
method are justified only through simulations and for some special cases counter 
examples have been proposed [11]. However the simplicity of these algorithms 
and their great performance in several instances, suggest further study of the 
SIS method is necessary. 

Our Result. Let d\ , . . . , d n be non-negative integers given for the degree sequence 
and let Ym=i d% = 2m. Our algorithm is as follows: start with an empty graph 
and sequentially add edges between pairs of non-adjacent vertices. In every step 
of the procedure, the probability that an edge is added between two distinct 
vertices i and j is proportional to didj(l — didj/Am) where di and dj denote 
the remaining degrees of vertices i and j. We will show that our algorithm 
produces an asymptotically uniform sample with running time of 0(md max ) 
when maximum degree is of 0(m 1 ^ 4 ~ T ) and r is any positive constant. Then we 
use a simple SIS method to obtain an FPRAS for any ep, S > with running time 
0(md max e -2 log(l/<5)) for generating graphs with d max = 0(m 1 / 4 ~ T ). Moreover, 
we show that for d = 0{n 1 / 2 ~ T ), our algorithm can generate an asymptotically 
uniform d-regular graph. Our results are improving the bounds of Kim and Vu 
[34] and Steger and Wormald [45] for regular graphs. 

Related Work. McKay and Wormald [37, 39] give asymptotic estimates for num- 
ber of graphs within the range d max = 0(m 1 / 3 ~ T ). But, the error terms in their 
estimates are larger than what is needed to apply Jerrum, Valiant and Vazirani's 
[25] reduction to achieve asymptotic sampling. Jerrum and Sinclair [26] however, 
use a random walk on the self-reducibility tree and give an FPRAS for sampling 
graphs with maximum degree of o(m 1 ^ 4 ). The running time of their algorithm is 
0(m 3 n 2 e~ 2 log(l/<f)) [44]. A different random walk studied by [27,28,10] gives 
an FPRAS for random generation for all degree sequences for bipartite graphs 
and almost all degree sequences for general graphs. However the running time 
of these algorithms is at least 0(n 4 ra 3 <i max log 5 (n 2 /e)e -2 log(l/<5)). 

For the weaker problem of generating asymptotically uniform samples (not 
an FPRAS) the best algorithm was given by McKay and Wormald's switch- 
ing technique on configuration model [38]. Their algorithm works for graphs 
with c?^ ax —0(m 2 /^2 i d 2 ) and c^ ax = o(m + '^ li d 2 ) with average running 
time of 0(m + (J2i^i) 2 )- This leads to 0(n 2 d 4 ) average running time for d- 
regular graphs with d — o(n 1 ^ 3 ). Very recently and independently from our 
work, Blanchet [12] have used McKay's estimate and SIS technique to obtain 
an FPRAS with running time 0(m 2 ) for sampling bipartite graphs with given 



degrees when d max = o(m 1 / 4 ). His work is based on denning an appropriate 
Lyapunov function as well as using Mckay's estimates. 

Our Technical Contribution. Our algorithm and its analysis are based on beau- 
tiful works of Steger and Wormald [45] and Kim and Vu [33]. The technical 
contributions of our work beyond their analysis are the followings: 

1. In both [45,33] the output distribution of proposed algorithms are asymp- 
totically uniform. Here we use SIS technique to obtain an FPRAS. 

2. Both [45, 33] use McKay's estimate [37] in their analysis. In this paper we give 
a combinatorial argument to show the failure probability of the algorithm is 
small and attain a new proof for McKay's estimate. 

3. We exploit combinatorial structure and a simple martingale tail inequality 
to show concentration results in the range d = 0(n 1 / 2 ~ T ) for regular graphs 
that previous polynomial inequalities [32] do not work. 

Other Applications and Extensions. Our algorithm and its analysis provide more 
insight into the modern random graph models such as the configuration model or 
random graphs with a given expected degree sequence [20]. In these models, the 
probability of having an edge between vertices i and j of the graph is proportional 
to didj. However, one can use our analysis or McKay's formula [37] to see that in 
a random simple graph this probability is proportional to did — didj /2m). We 
expect that, by adding the correction term and using the concentration result of 
this paper, it is possible to achieve a better approximation for which sandwiching 
theorems similar to [34] can be applied. 

We have also used similar ideas to generate random graphs with large girth 
[7] . These graphs are used to design high performance Low-Density Parity-Check 
(LDPC) codes. One of the methods for constructing these codes is to generate a 
random bipartite graph with a given optimized degree sequence [5] . Two common 
heuristics for finding good codes are as follows: (i) generate many random copies 
and pick the one with the highest girth; (ii) grow progressively the graph while 
avoiding short loops. While the former implies an exponential slowing down 
in the target girth, the latter induces a systematic and uncontrolled bias in the 
graph distribution. Using the same principles, we can remove the bias and achieve 
a more efficient algorithm that sequentially adds the edges avoiding small cycles. 

Organization of the Paper. The rest of the paper has the following structure. The 
algorithm and the main results are stated in Section 2. In Section 3 we explain 
the intuition behind the weighted configuration model and our algorithm. It also 
includes the FPRAS using SIS approach. Section 4 is dedicated to analysis and 
proofs. The rest of the paper is dedicated to supplementary proofs. 

2 Our Algorithm 

Suppose we are given a sequence of n nonnegative integers d\ , di , . . . d n with 
Ym=i di = 2m and a set of vertices V = {v\, V2, ■ • ■ , v n }. Assume the sequence of 



given degrees d\ , . . . , d n is graphical. That is there exists at least one simple graph 
with those degrees. We propose the following procedure for sampling (counting) 
an element (number of elements) of set C(d) of all labeled simple graphs G 
with V(G) = V and degree sequence d=(d\,d2, • • • , d n ). Throughout this paper 
m = Y^i=i 1S number of edges in the graph, d max = max™ =1 {di} and for 
regular graphs d refers to degrees; i.e. di = d for alH = 1, . . . , n. 



Procedure A 

(1) Let E be a set of edges, d = (di, . . . , d n ) be an n-tuple of integers and P be 
a number. Initialize them by E — 0, d — d, and P = 1. 

(2) Choose two vertices Vi, Vj € V with probability proportion to didj(l — f^) 
among all pairs i,j with i ^ j and (vi,Vj) £ E. Denote this probability by 
Pij and multiply P by pij . Add (vi , Vj ) to E and reduce each of di , dj by 1 . 

(3) Repeat step (2) until no more edge can be added to E. 

(4) If \E\ < m report failure and output N = 0, otherwise output G — (V,E) 
and N = (to! P)" 1 . 



Note that for regular graphs the factors 1 — didj/Am are redundant and Pro- 
cedure A is the same as Steger-Wormald's [45] algorithm. Next two theorems 
characterize output distribution of Procedure A. 

Theorem 1. Let r > 0: For any degree sequence d with maximum degree of 
0(m}l A ~ T ), Procedure A terminates successfully in one round with probability 
1 — o(l) and any graph G with degree sequence d is generated with probability 
within 1 ± o(l) factor of uniform. Expected running time of Procedure A is at 
most 0(md max ). 

For regular graphs a similar result can be shown in a larger range of degrees. 
Let £(n 7 d) denotes the set of all d regular graphs with n vertices: 

Theorem 2. Let t > 0: If d = 0(n 1 / 2 " T ) ; then for all graphs G G C(n,d) 
except a set of size o(\C(n,d)\) Procedure A generates G with probability within 
1 ± o(l) factor of uniform. In other words as n — > oo output distribution of 
Procedure A converges to uniform in total variation metric. 

The above results show that output distribution of Procedure A is asymptotically 
uniform only as n — > oo. But for finite values of n we use SIS to obtain an FPRAS 
for calculating \£(d)\ and randomly generating its elements. 

Definition 1. An algorithm for approximately counting (randomly generating) 
graphs with degree sequence d is called an FPRAS if for any e, S > 0, it runs 
in time polynomial in to, 1/e, log(l/5) and with probability at least 1 — 5 the 
output of the algorithm is number of graphs (a graph with uniform probability) 
up to a multiplicative error of 1 ± e. For convenience we define a real valued 
random variable X to be an (e, 5)-estimate for a number y if P{(1 — e)y < X < 
(1 + e)y} > 1 - 5. 

The following theorem summarizes our main result. 



Theorem 3. Let t > 0: For any degree sequence d with maximum degree of 
0(m 1 / 4 ~ T ) and any e,5 > we give an FPRAS for counting and generating 
graphs with degree sequence d with running time 0(m d max e -2 log(l/<5)). 

3 Definitions and the Main Idea 

Before explaining our approach let us quickly review the configuration model. 
Let W = U™ =1 W, be a set of 2m = Ym=i ^» labeled mini- vertices with |Wj| = di. 
If one picks two distinct mini-vertices uniformly at random and pairs them to- 
gether then repeats that for remaining mini-vertices, after m steps a perfect 
matching M. on the vertices of W is generated. Such matching is also called a 
configuration on W. One can see that number of all distinct configurations is 
equal to (1/m!) JI^Lq 1 (^'T^) - same equation shows that this sequential 
procedure generates all configurations with equal probability. Given a configu- 
ration M, combining all mini- vertices of each Wi to form a vertex Vi, a graph 
Gm is generated whose degree sequence is d. 

Note that the graph Gm might have self edge loops or multiple edges. In 
fact McKay and Wormald's estimate [39] show that this happens with very high 
probability except when d max = Ofiog 1 ^ 2 m). In order to fix this problem, at 
any step one [45] can only look at those pairs of mini- vertices that lead to simple 
graphs (denote these by suitable pairs) and pick one uniformly at random. For 
(i-regular graphs when d = 0(n 1 / 28 ~ T ) Steger-Wormald [45] have shown this 
approach asymptotically samples regular graphs with uniform distribution and 
Kim-Vu [33] have extended that result to d = 0(n 1 / 3 - T ). 

Weighted Configuration Model. Unfortunately for general degree sequences some 
graphs can have probabilities that are far from uniform. In this paper we will 
show that for non-regular degree sequences suitable pairs should be picked non- 
uniformly. In fact Procedure A is a weighted configuration model where at any 
step a suitable pair (u, v) e Wi x Wj is picked with probability proportional to 
1 — didj/4m. 

Here is a rough intuition behind Procedure A. Define the execution tree T of 
the configuration model as follows. Consider a rooted tree where root (the vertex 
in level zero) corresponds to the empty matching in the beginning of the model 
and level r vertices correspond to all partial matchings that can be constructed 
after r steps. There is an edge in T between a partial matching M r from level 
r to a partial matching A4 r +i from level r + 1 if M. r C M. r +i- Any path from 
the root to a leaf of T corresponds to one possible way of generating a random 
configuration. 

Let us denote those partial matchings M. r whose corresponding partial graph 
GM r is simple by "valid" matchings. Denote the number of invalid children of 
A4 r by A(A4 r ). Our goal is to sample valid leaves of the tree T uniformly at 
random. Steger- Wormald's improvement to configuration model is to restrict the 
algorithm at step r to the valid children of A4 r and picking one uniformly at 
random. This approach leads to almost uniform generation for regular graphs 



[45, 33] since the number of valid children for all partial matchings at level r of 
the T, is almost equal. However it is crucial to note that for non-regular degree 
sequences if the (r + l) th edge matches two elements belonging to vertices with 
larger degrees, the number of valid children for A4 r +i will be smaller. Thus there 
will be a bias towards graphs that have more of such edges. 

In order to find a rough estimate of the bias fix a graph G with degree 
sequence d. Let M(G) be set of all leaves M. of the tree T that lead to graph G; 
i.e. Configurations M. with Gm = G. It is easy to see that |M(G)| = to! n"=i d%\. 
Moreover for exactly (1— q r ) |M(G)| of these leaves a fixed edge (i, j) of G appears 
in the first r edges of the path leading to them; i.e. € M r - Here q r = (m — 
r)/m. Furthermore we can show that for a typical such leaf after step r, number 
of unmatched mini- vertices in each Wj is roughly diq r . Thus expected number 
of un-suitable pairs (u,v) is about X^ G j didjq^(l — q r ). Similarly expected 
number of unsuitable pairs corresponding to self edge loops is approximately 
E?=i (*2 P ) » 2m^A(J) where X(d) = £^ =1 (t)/(£2=i Therefore defining 
7g = J2i~ G j didj/4m and using ( 2m ^" 2r ) ~ 2m 2 qf we can write 

n m— 1 ^ 

P(G) « to! [] di! J] 2m2g 2 _ 2mg 2A(d) - 4to(1 - g r )g r 2 7G 

n m — 1 ^ 
i=l r=0 v 2 y 

Hence adding the edge roughly creates an exp(didj / 4m) bias. To cancel 

that effect we need to reduce probability of picking (i,j) by exp(— didj/4m) w 
1 — did j /4m. We will rigorously prove the above argument in Section 4. 



Sequential Importance Sampling (SIS). For finite values of m output distribution 
of Procedure A can be slightly nonuniform but sequential nature of Procedure 
A gives us extra information that can be exploited to obtain very close approxi- 
mations of any target distribution, including uniform. In particular the random 
variable N that is output of Procedure A is an unbiased estimator for 
More specifically: 

E A (7V) = E A (A1 {A 

succeeds} ) 

GeC(d) kg 

where ttq denotes one of the to! ways of generating a graph G by Procedure A. 
Therefore we suggest the following algorithm for approximating 



Algorithm: CountGraphs(e, 5) 

(1) For k = k(e, d) run Procedure A exactly k times and denote the correspond- 
ing values of the random variable N by N\, . . . , Nk- 

(2) Output X = Wl+ - fc +jVfc as estimator for \C(d)\. 



We will show in the next section that if variance of the random variable N 
is small enough then Algorithm CountGraphs gives the desired approximation 
when k(e, S) = 0(e~ 2 log(l/<5)). In order to generate elements of £(d) with prob- 
abilities very close to uniform we will use the estimator X and apply SIS to a 
different random variable. For that the algorithm is as follows: 



Algorithm: GenerateGraph(e, S) 

(1) Let X be an (e, <5)-estimate for \£(d)\ given by Algorithm CountMatch. 

(2) Repeat Procedure A to obtain one successful outcome G. 

(3) Find an (e, <5)-estimate, Pg, for Pa(G) using Procedure B. 

(4) Report G with probability 3 ^g- and stop. Otherwise go back to step (2). 



The crucial step of Algorithm GenerateGraph is step (3) that is finding (e, 8)- 
cstimatc for Pa(G). For that we use Procedure B which is exactly similar to 
Procedure A except for steps (2), (4). 



Procedure B(G, e, 5) 

(1) Let E be a set of edges, d = (di, . . . , d n ) be an n-tuple of integers and P be 
a number. Initialize them by E — 0, d = d, and P = 1. 

(2) Choose an edge e = (vi,Vj) of G that is not in E uniformly at random. 
Assume that (vi, vj) is chosen with probability proportion to didj(l — ^jf-) 
among all pairs i, j with i ^ j and (vi,Vj) ^ E. Denote this probability by 
Pij and multiply P by pij. Add (vi, Vj) to E and reduce each of d%, dj by 1. 

(3) Repeat step (2) until no more edge can be added to E. 

(4) Repeat steps (1) to (3) exactly I = £(e, 6) times and let Pi, . . . , Pi be corre- 
sponding values for P. Output Pq = ml Pl+ '^ +Pe as estimator for Pa(G). 



Note that random variable P at the end of step (3) is exactly Pa(7Tg) for a 
random permutation of the edges of G. It is easy to see that Eb(P) — Pa(G)/to!. 
Therefore Pg is an unbiased estimator for Pa(G). Later we will show that it is 
an (e, <5)-estimator as well by controlling the variance of random variable P. 

4 Analysis 

Let us fix a simple graph G with degree sequence d. Denote the set of all edge- 
wise different perfect matchings on mini-vertices of W that lead to graph G by 
R(G). Any two elements of R(G) can be obtained from one another by permuting 
labels of the mini- vertices in any Wi. We will find probability of generating a 
fixed element M in R(G). There are ml different orders for picking edges of M 

3 In section 4 we will show that < 5Pg for large enough n (independent of e, 8 > 0). 



sequentially and different orderings could have different probabilities. Denote 
the set of these orderings by S(M ). Thus 



MeR(G)Af£S(M) i=l AfeS(M) 

For any ordering N = {e\, . . . , e m } G S(M) and < r < m— 1 denote probability 



of picking the edge e r+ i by P(e r+ i). Hence Pa (A/ - ) = Il^Lo 1 IPC e r-+i ) an d each 
equals to 

F(e r+1 = (i,j)) 



term P(e r+ i) equals to 

(1 - didj/Am) 



J2(u,v)eE r d^d^il - d u d v /4m) 

(r) 

where d\ denotes the residual degree of vertex i after r steps which shows num- 
ber of unmatched mini- vertices of Wj at step r. The set E r consists of all possible 
edges after picking ei, . . . , e r . Denominator of the above fraction for P(e r+ i) can 
be written as ( 2m ~ 2r )->^(jV r ) where 3^) = ^(M-)+E(i, j)e B r d[ r) d ( - ] d t dj I ^m. 
This is because 22 (u v)eE r du d v counts number of suitable pairs in step r and 
is equal to ( 2m ^ 2r ) — A(Af r ). The quantity ^(Afr) can be also viewed as sum of 
the weights of unsuitable pairs. Now using 1 — x = e - x +°( x2 ) for < x < 1 we 
can write Pa(G) in the range d max = 0(m 1 / 4: ~ T ) as following 

n j i m— 1 

PA(G)=n*«n( 1 -^) e np^j 

i=l l ~GJ AAGS(A4) r=0 I 2 J ^-^j 

n m— 1 

i=l AAeS(A4) r=0 \ 2 J ^^V r j 

The next step is to show that with very high probability ^(Af r ) is close to a 
number Vv(G) independent of the ordering TV. More specifically for ip r (G) = 

(2- - 2,) 2 + + the following is true 

m— 1 ^ m — 1 ^ 

The proof of this concentration uses Kim-Vu's polynomial [32] and is quite tech- 
nical. It generalizes Kim-Vu's [33] calculations to general degree sequences. To 
the interest of space we omit this cumbersome analysis but it is given in in Section 
5 of extended version of this paper [8]. But in Section 4.1 we show concentration 
for regular graphs in a larger region using a new technique. 

Next step is to show the following equation for d max = 0(m 1 / 4 ~ T ). 

m— 1 m— 1 -. 

TT t = TT 1 A(d)+A 2 (d)+ 7G +o(l) (o) 



Proof of equation (2) is algebraic and is given in Section 5.2 of [8]. Following 
lemma summarizes the above analysis. 



Lemma 1. For c? max = 0{m 1 / i T ) Procedure A generates all graphs with degree 
sequence d with asymptotically equal probability. More specifically 

V P(Af) - m! \(d)+\ 2 (d)+o(i) 

M&(M) llr=(U 2 I 

Proof ( of Theorem order to show that Pa (G) is uniform we need an impor- 

tant piece. Lemma 1 shows that Pa(G) is independent of G but this probability 
might be still far from uniform. In other words we need to show that Procedure 
A always succeed with probability 1 — o(l). We will show this in section 6 of [8] 
by proving the following lemma. 

Lemma 2. For d max = Oim v l^ r ) the probability of failure in one trial of Pro- 
cedure A is o(l). 

Therefore all graphs G are generated asymptotically with uniform probability. 
Note that this will also give an independent proof of McKay's formula [37] for 
number of graphs. 

Finally we are left with analysis of the running time which is given by the 
following lemma. 

Lemma 3. The Procedure A can be implemented so that the expected running 
time is 0(md max ) for d max = 0(m 1/4 ~ T ). 

Proof of Lemma 3 is given in Section 7 of [8] . This completes proof of Theorem 
1. ■ 



Proof (of Theorem 3). First we will prove the following result about output X 
of Algorithm GraphCount. 

Lemma 4. For any e, S > there exist k(e, d) = 0(e~ 2 log(l/(5)) where X is an 
(e, <5) -estimator for \£(d)\. 

Proof. By definition 

p«i - c) w)\ < x < (i + e) |£(j)i) = p {- J^EL < ^MB < J^L] 

^ ' ^ / Var A (jV) / Var A (AQ / Var A (AQ J 

On the other hand by central limit theorem lim^oo ^~ e aW = z ^ N(0, 1). 

yfV&r A (N)/k 

Therefore eE A (jV) guarantees that X is (e, <5)-estimator |£(<i)| where 

A /Var A (AT)/fc b y ' ' 

¥(\Z\ > z s ) = 8. This condition is equivalent to: k > z 2 e- 2 Va,r A (N)/E A (N) 2 . 
Moreover for tail of normal distribution we have P(Z > x) ~ x~ x e~ x ^ 2 (2tt)^ 1 as 



x — > +00 which gives zf = o(log(l/<5)). This means if we show \&v^{N) /~Rx{N) 2 < 
oo then k = 0(log(l/<5)e~ 2 ). In fact we will prove a stronger statement 

Proof of equation (3) is given in Section 8.1 of [8]. 

Last step is to find running time of Algorithm CountGraph. By Theorem 1 
each trial of Procedure A takes time 0(m d max ). Therefore running time of Algo- 
rithm CountGraph is exactly k times 0(md max ) which is 0(md max e~ 2 log(l/<5)). 
This finishes proof of Lemma 4. ■ 

Now we are left with analysis of Algorithm GenerateGraph which is given by 
following lemma. 

Lemma 5. For any e, S > there exist £(e, 5) for Procedure B such that Al- 
gorithm GenerateGraph uses 0(md max e~ 2 \og(l/5)) operations and its output 
distribution is an (e, S) -estimator for uniform distribution. 

Similar calculations as in proof of Lemma 4 show that provided Vare (P) /Eb (P) 2 < 

00 then for any graph G there exist £(e, 5) = 0(e~ 2 log(l/5)) such that estimator 
Pq is an (e, (5)-estimator for Pa(G). Similarly we will show the following stronger 
result in section 8.2 of [8]. 

VarB ( P ) = (1) (4) 
E B (P) 2 { ' ( ' 

Now we are ready to prove Lemma 5. 

Proof (Proof of Lemma 5). From Theorem 1 there exist integer n > inde- 
pendent of e,5 such that for n > no the following is true 0.5|£(rf)| < Pa(G) < 
1.5|£(J)|. Thus using Lemma 4 for any e',6' > and k = k(e',5') and also 

1 = k(e', S'), with probability 1 — 25' we have: 

Note that for 5' < 5/2 and e' < min(0.25, e/2) the upper bound in equation 
(5) is strictly less than 5 which means hPcX > 1 and this validates step (4) 
of Algorithm GenerateGraph. Moreover the lower bound in equation (5) is at 
least 1/8 which shows that expected number of repetitions in Algorithm Gen- 
erateGraph is at most 8. On the other hand probability of generating a graph 
G by one trial of Algorithm GenerateGraph is equal to Fa(G)/(5XPg). This 
means probability of generating an arbitrary graph G in one round of Algorithm 
GenerateGraph is an (e, <5)-estimator for ^L. Therefore final output distribution 
of Algorithm GenerateGraph is an (e, <5)-estimator for uniform distribution. 

Expected running time of Algorithm GenerateGraph is at most 8fc times 
expected running time of Procedure A plus expected running time of Algorithm 
GraphCount which is 0(md max e^ 2 log(l/<5)). This finishes proof of Lemma 5 
and therefore proof of Theorem 3. I 



4.1 Concentration inequality for random regular graphs 



Recall that £(n, d) denotes the set of all simple d-regular graphs with m = 
nd/2 edges. Same as before let G be a fixed element of £(n, d) and M be 
a fixed matching on W with Gm — G. The main goal is to show that for 
d = o{n 1 / 2 ~ T ) probability of generating G is at least (1 — o(l)) of uniform; i.e. 
P A (G) = (d!) n Ej^eS(M) P (^0 > (l-o(l))/IA«.d)l- 0ur proof builds upon 
the steps in Kim and Vu [34] . The following summarizes the analysis of Kim and 
Vu [34] for d = 0(n 1 / 3 ~ T ). Let mi = where to goes to infinity very slowly; 
e.g. 0(log <5 n) for small 5 > then: 

/ n 1 m m_1 /2m-2r\ _ 

|r(n,d)|(d!)» E F(AA) ( ^i-ifl Eil 1 2 j 



AAeS(Ai) m! ATeS(M) r=0 ( ™2 ~ A W 

1-0(1) ^ jfj^l ' 



jVeS(.M)r=o 

mi 



( 2 V)-^W-) 



r=0 x 

(<z) / " ll T 1 + T 2 \ 

s < i -°< i »-'(- 8e S^#j (6) 

Here we explain these steps in more details. First define \i r = fa + fi 2 where fa = 
(2m-2r) 2 (d-l)/4m and fij = (2m - 2r) 2 (d - l) 2 r/4m 2 . Step (a) follows from 
equation (3.5) of [34] and McKay- Wormald's estimate [39] for regular graphs. 
Also algebraic calculations in page 10 of [34] justify (b). 

The main step is (c) which uses large deviations. For simplicity write A r 
instead of A{M r ) and let A r = A\ + A 2 , where A\ and A 2 , show the number 
of unsuitable pairs in step r corresponding to self edge loops and double edges 
respectively. For p r — r/m, q r = 1 — p r Kim and Vu [34] used their polynomial 
inequality [32] to derive bounds T*, TJ and to show with very high probability 
\Al — fa*\ < Tr and \ A 2 — [i 2 \ <T 2 . More precisely for some constants c\, Ci 

T} = ci log 2 n v / nd 2 q 2 (2dq r + l) , T 2 = c 2 log 3 n V / nd^q 2 {d 2 q r + 1) 

Now it can be shown that fa = o ((2m - 2r) 2 ) and T* = o ((2m - 2r) 2 ) , i = 1, 2 
which validates the step (c). The step (d) is straightforward using 1 — x > e~ ex 
for 1 > x > 0. 

Kim and Vu show that for d = 0(n 1 / 3 ~ T ) the exponent in equation (6) is of 
o(l). Using similar calculations as equation (3.13) in [34] it can be shown that 
in the larger region d = 0{n 1 l 2 ~ T ) for m 2 = (mlog 3 n)/d: 

mi rpi mi rp 2 

^ (2m -2r) 2 = ' 2 (2m~2r) 2 = 

r— r—m-2 ■ ' 



But unfortunately the remaining YlT=o (2ro-^2r) 2 ^ s °^ 0T( ^ CT f?(d 3 /n). In fact it 
turns out the random variable A 2 has large variance in this range. 

Let us explain the main difficulty for moving from d = 0(n 1 ^^ T ) to d = 
0(n 1 ^ 2 ^ T ). Note that A 2 is defined on a random subgraphs Gtf r of the graph G 
which has exactly r edges. Both [45] and [33, 34] have approximated the GV r with 
G Pr in which each edge of G appears independently with probability p r = r/m. 
Our analysis shows that when d = 0(n 1 / 2 ~ T ) this approximation causes the 
variance of A 2 , to blow up. 

In order to fix this problem we modify A 2 , before moving to G Pr . It can be 
shown via simple algebraic calculations that: A 2 — fi 2 = X r + Y r where X r = 

S«~ Gjv vi- d u ] -qr(d-l)][d { v r) -q r (d-l)] and Y r = q r (d-l) J2 U (d^ - q r d) 2 - dp r q r 
This modification is critical since the equality A 2 — fi 2 = X r — Y r docs not hold 
in G Pr . 

Next task is to find a new bound T 2 such that \X r — Y r \ < T 2 with very 

high probability and J2T=o (2m-2r) 2 = ^ ^ s eas y *° see that in G Pr both 

X r and Y r have zero expected value. At this time we will move to G Pr and show 
that X r and Y r are concentrated around zero. In the following we will show the 
concentration of X r in details. For Y r it can be done in exact same way. 

Consider the edge exposure martingale (page 94 of [3] ) for the edges that are 
picked up to step r. That is for any < I < r define Z\ — E(X r \ e\, . . . , eg). 

(r) 

For simplicity of notation let us drop the index r from Z r t ,d u ,p r and q r . 

Next step is to bound the martingale difference \Zi — Note that for 

e, = (u,v): 



\Zi—Zi. 



(d u -(d-l)q)(d v -(d-l)q)+ (d*-(d-l)q)+ (d v ,-(d-l)q) 



(7) 

Bounding the above difference should be done carefully since the standard worst 
case bounds are weak for our purpose. Note that the following observation is 
crucial. For a typical ordering Af the residual degree of the vertices are roughly 
dq ± \fdq~. We will make this more precise. For any vertex u € G consider the 
following L u = {\d u — dq\ < clog 1 / 2 n(dq) 1 / 2 } where c > is a large constant. 
Then the following lemma holds: 

Lemma 6. For all < r < m 2 the following is true: P(££) = o(Aj). 

Proof of Lemma 6 uses generalization of the Chernoff inequality from [3] and is 
given in Section 9 of [8]. 

To finish bounding the martingale difference we look at other terms in equa- 
tion (7). For any vertex u consider the event 

Ku=\\ ]T {d u ,-{d-l)q)\ <c({dqfl 2 + qd + dq 1 l 2 Yogn 

where c > is a large constant. We will use the following lemma to bound the 
martingale difference. 



Lemma 7. For all < r < ni2 the fallowings hold P(K^) = o(^). 

Proof of Lemma 7 is given in Section 9 of [8] . 

Now we are ready to bound the martingale difference. Let L — Hull {L U C\ 
K u ). Using Lemmas 6, 7 and the union bound P(L C ) = o(l/m 2 ). Hence for the 
martingale difference we have |Zj — < 0(dq + dq 1 ' 2 + (dq) 3 / 2 ) logn. 

Next we state and use the following variation of Azuma's inequality. 

Proposition 1 (Kim [31]). Consider a martingale {Yi}™_ adaptive to a fil- 
tration {#i}™ =0 . If for all k there are A^-i € Bfe-i such that E[e wYfe |Bfe_i] < Cu 
for all k = 1, 2, • • • , n with Cu > 1 for all k, then 

n 

P(Y - E[Y] > A) < e-^ l[C k + P(U^^ fc ) 

fe=i 

Proof (of Theorem 2). Applying the above proposition for a large enough con- 
stant c' > gives: 

p(\X r \ > c'^Srlog 3 n (dq + d(q) 1 / 2 + (dq) 3 / 2 )^ < e' 3lo ^ n + P(L C ) = o(-^) 

The same equation as above holds for Y r since the martingale difference for Y r 
is of 0(\dq r (d u - q r d)\) = 0((dq) 3 / 2 log 1/2 n)) using Lemma 6. 

Therefore defining f 2 = c'{dq + d(q) 1 / 2 + (dq) 3 / 2 ) yV log 3 n we only need to 
show the following is o(l). 

^ (dq + d(q) 1 / 2 + (dq) 3 / 2 )^6rlog 3 n 

But using ndq = 2m — 2r: 

m2 (dq + dq 1 / 2 + (dq) 3 ' 2 ) ^6r log 3 n 



n 2 d 2 q 2 

r=0 1 

~ 7^0 U 1/2 (2m - 2r) + (2m - 2r) 3 /2 + n ( 2m - 2r)V2 
^fd 1/2 \og(nd) d d \ , i = 

^ (^^ + (^o7^ + ^) log n 

which is o(l) for d = 0(n 1 / 2 - r ). ■ 

5 Generalizing Kim-Vu's Concentration Result and Proof 
of Equation (1) 

The aim of this section is to show equation (1) via generalization of Kim-Vu's 
analysis. Let us define: 

m— 1 /2m— 2r\ / //-«\ 

( j M r 2 - 2r ) - nK) 



then equation (1) is equivalent to 



E(/(A0) = l + o(l) (8) 

where expectation is with respect to uniform distribution on space of all ml 
orderings of M. that is denoted by S(M). Proof of equation (8) is done by 
partitioning the set S(Ai) into smaller pieces and looking at concentration of / 
on each of them. The partition is given in Section 5.3. But before that we need 
some extra notations. 



5.1 Definitions 

In section 4 we saw that probability of choosing an edge between Wi , Wj at 

step r + 1 of Procedure A is equal to (1 - [( 2m ^ 2r ) -^{K)]' 1 where 

W) = E {i , M E r 4 T) 4 ] + £ W ) e E r ^ r) 4 r %- For simplicity in notation 
from now <Zv means ^{M r ) unless it is specified. For our analysis we need to 
break \P r into more pieces. Let: 

A r = £ W 

^ 1 3 4m ^ 1 3 Am 

(id) t E r 

hence: 

& r = A r + A r 

For any edge (i, j) ^ E r there are two possibilities. It is either creating a self 
loop (i = j) or a double edge. We distinguish between these two cases by an 
extra index as follows: 

A-,i = ^ ( 9 ) se ^ ^ oops 

i=i ^ ' 
A r ,2 = A r — A r s double edges 

Moreover 

4m.l r = J2 d^d^didj - J2 dj r) 4 r) djdj 



i=l ^ ' $ E r , ift 

€ E r , i=tj 



We distinguish these last three terms by adding numerical indices to A r \ 

n n 



Hence 



i=1 i=1 (hi) i E r , i^j 



A = ^,1 + ^,2 A r , 3 



8m 4m 

The following simple bounds will be very useful throughout section 5. 
Lemma 8. For all r the fallowings hold: 

(i ) Ar < 

(ii) A rA < rf ma x(2m - 2r) 
Proof. : 

(i) At step r there are 2m — 2r mini- vertices left and for each each u € Wj 

there are at most <i ma x — 1 mini-vertices in Wi that u can connect to. Hence 
A ^ < (2m-2rKd max -i) _ Similarly u can conncct to at most (d - l) 2 mini- 

vertices from Wj with i ^ j to create double edge. Taking into account 

double counting A r2 < ( 2m ~ 2r )( 2 d """'~ 1 ) _ n ow use A r — A r i + A r 2 to get 

(ii) yl r> i < c? max X)„ dl r) = d max (2m - 2r) 

(iii) If follows from definition of A r that 

A r = V rf w (i wM<<E y rf w rf w<^f 2m - 2r 

^ ' J k " 4m ^ ! 3 " 4m \ 2 



In order to define ip r we look at a slightly similar model. Recall that Gj\f r is 
the partial graph that is constructed up to step r. With uniform distribution 
on S(A4) the graph Gj^ r is a random subgraph of G with exactly r edges. We 
can approximate this graph by G Pr which is the random subgraph of G which 
contains each of G independently with probability p r = r/m. Now we are ready 
evaluate quantities E Pr (A r ^),E Pr (A r ^2),Ep r (A r ^),E Pr (A rt2 ) and E Pr (/l r;3 ). 

Lemma 9. With uniform distribution on all m\ orderings of M: 

(i) E Pr (A rA ) = Qt^£. = 

(in) E Pr (A rA ) = (2m - 2r)^|^J 
(iv) E Pr (^ r , 2 ) = (2m-2r)2f^ 



(v) E Pr (A r%3 ) = i — i- ^ GJ 2m 3 J 

Proof, (i) Under uniform measure on S(.M) each of first r edges of A/" is chosen 
uniformly at random. So every edge has a probability of — to be chosen. 
Let Xi be number of unordered tuples {j, i, k} where {j, i}, {i, k} £ E(G) \ 
E{Gj^ r ). Then Xi is the number of unsuitable edges that connect two mini- 
vertices of Wi at (r + \) th step of creating Af. Hence 

n 
i=l 

On the other hand for a fixed i, number of tuples { j, i, k} where { j, i}, {i, k} £ 
E(G) is exactly (*) and probability of {j,i},{i,k} $ E(G// r ) is (1 - 
which gives: E(Xi) = (1 - ^) 2 (2*) and this combined with (9) completes 
proof of (i). 

(ii) Define Yy be number of unordered tuples {k,i, j, 1} where {i,j} £ E{Gj^ r ) 
but {k, i}, {j, 1} £ E(G) \ E(Gj^ r ). Then is exactly number of unsuitable 
edges between Wi,Wj at (r + l) th step of creating Af. Hence 

MK) = £ Y v ( 10 ) 



On the other hand for fixed a i j, number of tuples {k,i,j, 1} where 
{k,i},{j, 1} £ E(G) is exactly (d* — l)(dj — 1) and probability of {k, i}, {j, 1} ^ 
E(Gj^ r ) is (1 — and probability of {i, j} £ E(Gj^ r ) is ^ which gives: 
E(Yy) = ^(1 -^) 2 (^ - 1)(^ - 1) and this with (10) completes proof of 
("') 

),(iv) These two parts are direct consequence of E(dj r ') = (1 — ^)c£j. 
(v) Exactly similar to (ii) except we are using the following instead of (10) 



Definition 2. Exact definition of i[) r from Section 4 is given by: 

W-E Pr (A-) + ^ (11) 

Corollary 1. for a// r using equation (11) we obtain ip r =0( ^ d """ t ^™~ 2r '^ j. 



Now we are ready to prove equation (2) 



5.2 Algebraic Proof of Equation (2) 



ro-l (2m—1r\ 

TT [ 2 ' 

r U „(V)-i 

m— 1 . 

n d+ ti 

m-l M , rT,^ G] (d l -l)(d ] -l) (S , d 2) 

n/. m ' 2m 3 16m 3 

- ^ -0{ 



+ 0(- 



+ 



m(2m — 2r) 



r=0 



2m-2r 



= exp 
= cxp 
= cxp 

= cxp 
= cxp 
= exp 



m— i 

[£iog(i 



m(2m-2r) 



r=0 
m— 1 



1 - 



2m-2r 



L ^— ' V m 2m 3 16m 3 m 2 



+ 



d 2 



r=0 
m— 1 



m 2 m(2m — 2r) 



[EC 

[a(J) + 

A(d) + 



2m 3 



+ 



16m 3 



0(- 



+ 



m 2 m(2m — 2r) 



)) 



m(m - 1) Ei~ Gj K - iKjj - 1) (E^ 



2\2 



4 m 3 



E^ GJ K-i)K -i) , (Ei < 2 ) 



4m 



+ 



16m 2 
d 2 



0(^log(2m) + ^£) 
m m 



16m 2 



+ 0(_5»« + log(2m) + -2SS) 



4m 4m 4 16m z 



m m 
2 



m. 



(l + o(l)) exp A(d) + A 2 (d) + 



4m 



where (12) uses log(l + x) = x - 0(x 2 ) and (13) uses d max = 0(m 1 / 4 " r ). We 

/ d 2 

also used ^ 2m _ 2r ^ = 0(-^f L ) couple of times. ■ 



5.3 Partitioning the Set of Orderings S(Ad) 

Partitioning of S(M) will be done in four steps. At each step one or a family of 
subsets of S(M) will be removed from it. 

Step 1) Consider those orderings A/" € S(A4) where at any state during the algorithm 
number of unsuitable edges is a constant fraction of number of all available 
edges. More specifically for any arbitrary small < r < 1/3 let: 

S*(M) = IN e S{M) | ${K) < (1 - r/A)( 2m 2 2V | : V ' r <, m - I 



The first element of the partition will be the set S(M) \ S*(M). Now we 
need to partition S*(A4) further. 



Step 2) Consider those members of S*(M) for which &(Af r ) -9 (W r ) > (logn) 1+<5 for 
all < r < m — 1. Denote this set by A. Here <5 is a small positive constant. 
For example 5 < .1 works. 

Step 3) From S'*(A / () \ A remove those elements with \P(N r ) > for some r with 
2m — 2r < (log n) 1+2S . Put these elements in the set B. 

Step 4) Last element of the partition is C = S*(M) \ (A U B). 

Equation (1) is result of the fallowings: 



E(/(.A0U) = o(l) (15) 

E(/(AOls) = o(l) (16) 

E(/(^)l c )< l + o(l) (17) 

E(jW)lc) > l-o(l) (18) 

E(./W)1 S (.M)\ S .(M)) - 0(1) (19) 



Proof of (15) involves further partitioning of A into smaller sets and using Vu's 
inequality carefully. It is given in section 5.5. Proofs of (16-19) are easier and 
use typical combinatorial and algebraic bounds. They are also proved in section 
5.5. 



5.4 More notations 

In order to prove (15) we need some extra notations. Let w = (logn) 5 . Remember 
from section 5.3 that where S > is a very small constant. Let A = uologn and 
Aj = 2 l Ao for i = 1, 2, . . . , L. L is such that < cd 2 \ogn < Xl where c is a 

large constant specified later. For any < r < m — 1 : T r isa function defined 
as follows: 

Definition 3. Let q r = (1 — r/m), p r = 1 — q r VO < r < m — 1. Then let 



a r (A) = cyj X(md max q 2 + \ 2 )(dq r + A) 
PAX) = c VggggTAggVTAj 

7r(A) - Cy/\( mdl ax q3 + \ 3)(<Pq? + \*) 
Cr(A) = cd 2 q r y / X(mq r + A) 
v r = 8m< ax ^ 

T r (X) = /"r(A)+/3 r (A) + Cr(A) + (l + ^)min( 7 (A),^) if2m-2r>ujX 
1 X 2 /uj Otherwise 

The intuition behind this definition becomes more clear when we will use it for 
concentration inequalities in section 5.5. Note that with the above definition 
since Xi = 2A^_i the following is true 



T r (A;_i) < T r (Xi) < 8T r (A 4 _i) 



(20) 



In order to prove (15, 16) we need to partition A, B into smaller pieces. For that 
let us repeat steps 2 and 3 of partitioning in section 5.3 more carefully. 
Define subsets A C A 1 C . . . C A L C S*(M) as follows: 

Ai = {TV e S**(M) | H?{N r ) - ij) r (M) < T r (Xi) : V0<r<m-1} 

and = S*(M) \ LI-LqA- Then we have: 

A = A 00 U (uf =1 Ai \ At-!) 

Let K be an integer such that 2 K - 1 < (logn) 2+s + 1 < 2 K . Define sets B Q C 
Si C • • • C B K C A by: 

= {/V e A | &(Af r ) < V : V r > (2m - cjA )/2} 

Hence C = Bq. Note that T r 's are chosen such that T r (Ao) = Aologn : Vr> 
(2m — wAo)/2 and by corollary 1 for all r > (2m — uj\ )/2 : ip r = o(l). Thus for 
all such r and all elements of A 

& r < A log n + ip r < 2 K 

this shows: A = uf= Bj and B = uf =1 Bj. 

5.5 Proofs of (15), (16) and (17) 

Consider the following four lemmas: 
Lemma 10. For all 1 < i < L 

(a) P„(A i \^-i)<e-^) 

(b) f(N)<e°^ : WV Gij\ 

Lemma 11. ; 

(oj P„(A 00 ) < e - cd »-i°g™ 
ft) /(A/ - ) < e 4d »- lo s" : V7V e 

Lemma 12. For all 1 < j < K 

(a) F u (Bj \ Bj_!) < e -«(2 j/2 iog«) 

(b) f(M) < e°( 23j/4 ) : V7V e Bj \ B^ x 
Lemma 13. For all Af eC, f(M) < 1 + o(l) 

Now it is easy to see that (15) is result of Lemmas 10, 11. Note that by definition 
of K we have 2 K ^ < logn so 2 3j / 4 < 2-?/ 2 logn. Thus one can deduce (16) from 
Lemma 12 and finally (17) is consequence of Lemma 13. 

Proof of Lemma 10 uses Vu's concentration inequality but for other three 
Lemmas typical algebraic and combinatorial bounds are sufficient. Section 5.5 
is a quick introduction to Vu's concentration inequality. Section 5.5 gives proofs 
of Lemmas 10(a) and 11(a). In section 5.5 part b of the same lemmas will be 
shown. Finally sections 5.5 and 5.5 prove Lemmas 13, 12 respectively. 



Vu's Concentration inequality Proofs of Lemmas 10(a) and 11(a) use a 
very strong concentration inequality proved by Vu [48] which is a generalized 
version of an earlier result by Kim and Vu [32]. Consider independent random 
variables t\, t 2 , . . . ,t n with arbitrary distribution in [0, 1]. Let Y(ti,t2, ■ ■ ■ , t n ) be 
a polynomial of degree k and coefficients in (0, 1]. For any multi-set A of elements 
ti, t 2 , . . . , t n let &a Y denote partial derivative of Y with respect to variables in 
A. For example if Y = ti + t\t\ and A = {t 1 ,t 1 }, B = {t 1; t 2 } then 

d A Y=^Y = 6 tl tl , BBY=J^Y = Ofa 

For all < j < k, let Ej(Y) = max\ A\>j E(8aY). Define parameters Ck,dk 
recursively as follows: ci = 1, d\ = 2, Cfe = 2fc 1 / 2 (c / t_i + 1), dk = 2{du-i + 1). 

Theorem 4 (Vu). Take a polynomial Y as defined above. For any collection of 
positive numbers So > £\ > ■ ■ ■ > Sk = 1 and A satisfying: 

- £j > Ej(Y) 

- £j/£j+i > A + 4j logn, < j < k - 1 
the following holds 

P (\Y - E{Y)\ > c fev /A£ fi) < d k e- x /\ 

Proof of part (a) of Lemmas 10 and 11 In this section in order to show 
part (a) of Lemmas 10 we actually show the stronger property: 

P u (^i_i) < e- f2(As) (21) 

This property combined with A^ > cd 2 logn proves part (a) of Lemma 11 too. 
From (20) we have: 

4lC{*r-*r>^} 

so to show (21) it is sufficient to show the following two lemmas. 
Lemma 14. For all r such that 2m — 2r > wAj. - 
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&(K)-ipr > i(a I .(A 4 )+/3,.(A J ) + Cr(A J ) + (l + ^)min( 7r (A l ), u r )^j 



< e" 



Lemma 15. For any r such that 2m — 2r < wAj 

Now we focus on Lemma 14. For each variable A r , A r , <F r denote their analogues 
quantity in G Pr by A Pr , A Pr , >F Pr . 



Lemma 16. Vr: 

F Pr ({\E(G Pr )\=r})>± 

Proof, let f(m, r) = F Pr ({\E(G Pr )\ — r}) then it can be seen that: 
/(m,r + l) (l + l/rf 



<1 Vr<(m-l)/2 



< e 



-fi(Ai) 



so minimum of f(m, r) is around r = m/2 and for that using Stirling's approxi- 
mation we can get f(m, r) > > i ■ 

By Lemma 16, with probability at least 1 /n, G Pr has exactly r edges and since 
Xi S> log n to prove Lemma 14 we need to show: 

P (* Pr -A>\ (ar(\i) + Pr(Xi) + Cr(Ai) + (1 + ^) min( 7 r(A i ), I/ r ))) 

(22) 

From now until end of the proof we fix r, i and remove all sub-indices r, i for 
simplicity. Note that expectation with respect to randomness in G p is the same 
as expectation with respect to uniform distribution on S{M). Therefore (22) is 
result of the following: 

Lemma 17. : 

(i) P(A pA - E(A pA ) > a/8) < e- n W 
(ii) P (A Pi2 - E(A p , 2 ) > min(/3 + 7, [3 + u)/8) < e-"^ 
(Hi) P - E( ^- l) s+ E( ^' 2) > C/8) < e- f2(A) 

M P (%r - > %f min (/5 + 7, /3 + i/)/8) < e - f2 < A ) 

Proof. : 

(i) Same as Kim-Vu's proof for each edge e of G consider random variable t e 
which is equal to when e is present in G p and 1 otherwise. These t e 's will 
be i.i.d. Bernoulli with mean q. Now note that: 

a p ,i = ^2 X! tet f 

and E(Ap tl ) = E« C^)"? 2 < md max q 2 . Next for each t e , E(d tc A Pil ) = 
^(S/ : /ne^0 tf) < 2(d max — l)q < 2d max q. And any partial second order 
derivative is at most 1. So we have: 

Eo(A>,i) < max(md max q 2 ,2d max (7, 1) , E 1 (A pA ) < max(2d max q, 1) andE 2 (Zip,i) = 1 

Set £0 = 2md max g 2 + 2A 2 , £1 = 2d max q + A, £2 = 1 then using A » logm 
conditions of Theorem 4 satisfy. On the other hand if c in definition of a is 
large enough we have C2\J£q£\ < a/8 hence by Theorem 4 part (i) is proved. 



(ii) We need to prove the followings 



(^ 9 -E(4,, a )>^)<e- n M 



(23) 



A p>2 - E(Z\ p , 2 ) > i±Ji) < e- n ^ (24) 



Consider the same random variables t e from part (i). Let R be set of all 
paths of length 3 in G. Then 

\,i = ^2 tet a( 1 - l f)= ^2 tet 9 ~ ^2 t ^ t f t 9 

{eJ,g}eR {e,f,g}£R {e,f,g}£R 

Let Yi = Y,{ e j, g }eR tet 9 and ^ 2 = S{ e ,/, s }ei? *e*/V Similar to part (i) we 
have: 

Eo(Yi) < max(™< ax g 2 ,2< ax(Z , 1) , E 1 (Y 1 ) < max(2< ax<Z , 1) andE 2 (Y 1 ) = 1 

Now set £ a = 2md 2 aax q 2 + 2A 2 , E\ = 2d 2 nax q + X,£ 2 = 1. These satisfy 
conditions of Theorem 4. Again by considering c large enough we have 

PflYi - E(Yi)| > /3/8) < e^ 2(A) (25) 

and for Y 2 

E (Y 2 ) < max(md 2 nax g 3 ,2d 2 nax ( ? 2 ,2d max q, 1) , E 1 (Y 2 ) < max(2d 2 nax q 2 , 2d max q, 1) 

E 2 (Y~ 2 ) < max(2d max g, 1) and E 3 (F 2 ) = 1 

Set £ = 2m< ax g 2 + 3A 3 , £ 1 = 2d 2 max q 2 + 2X 2 ,£ 2 = 2d max q + \£ 3 = 1 to 
get: 

P(\Y 2 - E(Y 2 )\ > 7/8) < e~ n W (26) 

Combining (25,26) together (23) is proved. 
Finally (24) is result of (25) and the following 

A p , 2 -E(A Pt2 ) < |yi-E(Y 1 )|+E(Y 2 ) < \Y 1 --E(Y 1 )\+m<% Dm tf i < |Fi-E(yi)|+^/8 
(iii) We will show the followings: 

P( Sr " ^toT " c i<ax9VA(A + m,)) < e-»W (27) 

P (^ _ E(^) > C -±^L^\(\ + mq )) < e~ n ^) (28) 
8m 8m 4m 

Note that by making c in definition of £ large enough (27) and (28) together 
give us (iii). 



First we show (27). Note that one can write: 

,. 4- d„, 

-t e 



Ap.i d u + d v 



2c?max , n^ev^in 2<i max 

e=(u,v)eE(G) 

which is a polynomial with coefficients in (0, 1]. As before 

E (/l p ,i/(2d max )) < max(m?,l) , E 1 (A Ptl /(2d max )) < 1 
now set £ a = A + mq, £\ = 1. so we get 

F(\A pA /(2d max ) -E(yl Pil /(2d max ))| > c lv /A(A + m g )) < d ie "^ A ) (29) 



By Lemma 8(h) yl Pi i < 2m d max q, so /l 2 ;1 — E(/l P: i) 2 > 8cito d^^q^/ A(A + mq) 
yields yl p ,i/(2d max ) - E(/l p ,i/(2d max )) > c ly /\(\ + mq). Now using (29) 
equation (27) is trivial. 
Proof of (28) is much easier. Start with 

A P ,2 _ sr^ d u +d v 
2d 2 " , V , 2d max 6 

e=(u,u)e£(G) max 

use 

E (vl p ,2/(2< ax )) < max(m <Z ,l) , E 1 (yl p , 2 /(2d 2 naJC )) < 1 
so with the same £o,£i we get 

w A p , 2 > CiVx{x + mq)) < c - nW 



2^max 2d ml _ 

which yields (28) easily, 
(iv) This case is treated exactly the same as (ii) because we have the following 

A P ,3 d u d v 

-j2—= -j2—t e t g (l-tf) 

maX {e,f,g}£R, e=(u,v) max 

and everything is as in (ii). 



Proof (of Lemma 15). Using Lemma 8(iii) and definition of \P from & p > X 2 /ui 
one can get 

A p > A p 

A 2 

> md 2 max q 2 

UJ 

A^ c?^,„ v .ccJ^ ,„ „ , 

> 2^ (30) 

u 4m 
X 2 



where (30) uses 2mq = 2m — 2r < uo\ and (31) holds since <i^ ax a> 3 <C m. 

Since 2m — 2r is small then G p is very dense so we consider its complement 
G q which is sparse. Let N n (u) — N(u) U {u}. Then using 

A Pr <J2d Gq ( u ) E d G» 

u v£N Q {u) 

from A p > X 2 /2ui one of the followings should hold: 

(a) G q has more than w 2 A/4 edges 

(b) For some u, £„ £J v o(u) d G q ( v ) ^ X / Uj3 

If (a) holds since 2mq < uj\: 

, , lo 2 X , , / m \ h^a 

F(G«j has more than edges) < ^ a) 2 A Jg 4 



" ( ^ 2 A~ ) 



< e -^(log^-l-log2) = e -fi(A) 

and if (b) holds note the number of edges of G that contribute in J2v<en (u) d G q {v) 
is at most d^ ax and each one contributes at most twice. Hence 

P( E d Gg (v)>\/^)< ( d ™Aq^ 

veN„(u) ^ 2^ / 

A 

m 

= e-^-Oog^-^sf^max^ 4 )-!) < e -n(^\ ogm ) = e -n{\) 

Note that we need 5 in definition of ui to be small enough such that log m^> u 3 
and actually for S < .1 this is true. I 



Proof of part (b) of Lemmas 10 and 11 Note that: 



f(M)= J] 1 + 



r=0 



( 2m " 2r ) - &{K) 



and since <^(jV r ) < (1 - T/4)( 2m ~ 2r ) for TV e S**(M): 



2 

m— 1 



Proof (of Lemma 10 part (b)). Using 1 + x < e x we only need to show: 



m— 1 



h (2"»"2r) 2 " ° (A) 



For simplicity let 3 (r) = max ^t^' 0) . Note that < 5 (r) < 1 so E 2 A ^ 2 , =2 5(0 = 
o(A) hence we only need to show J22m-2r=\/w 1 / 2 <7( r ) = °(^) • Also note that 
numerator of g(r) is at most T r (A) so by definition of T r (A) and using ^ -C 1 
we have: 

2m— 2 wA .2 

2 (2m-2r) 2 . 9 (r)< £ - 

2m-2r=A/u 1 /2 2m-2r=A/u 1 /2 



+2 ^ (a r (A)+/3 r (A)+Cr(A) + // r ) 

2m— 2r— cjA 

2m- 2 

+2 Yl («»■(*) + ^( A ) + ^( A ) + >( A )) 



2m-2r=wA 2 

A series of elementary inequalities will be used now to bound these three sums. 
We will use q r = ^ S ^ L : 

2 ^ 2 (Am< axgr 3 )V 2 _ X^d max 2 ^ 2 1 



\1/2j /-2m -, \V 2 W 



m ~ 2 XCW n 2 ) 1 / 2 \J 1/2 2m ~ 2 1 Ar/ 1/2 

fOm - r \2 ~~ O m l/2 9m - 9r- ~~ ^ rr.1/2 ° g 



(2m -2r) 2 2m 1 / 2 ^ 2m - 2r m 1 / 

2m— 2r— 2 v 7 2m— 2r— 2 

2»i-2 ,,3 , ni/2 \3/2j1/2 2m-2 A 3 / 2 ^ 1/2 

Et, A «max9rj _ A ' Umax 1 _ ^-)/ A "max s 

„ (2m-2r) 2 ~ (2m) 1 / 2 ^ , (2m - 2r) 3 / 2 ~ 1 ^ J 

2m— 2r— 2 v 7 v 7 2m — 2r— 2 v 7 

2 m -2 ^ 2 /'2m 

^ (2m-2r) 2 - X l > = ° (A) 

First three of last four equations imply next three equations using the same 
reasoning: 

2 ^ 2 (Am< ax a l/2 _ n( ^ 2 d 2 
2^=2 (2— 20 2 1 aM j 

2 v^ 2 A(md 2 na xg 2 ) 1/2 _ Ad max 
2- (2m -2r) 2 " ° [ m V 2 l ° gm) 

2m— 2r— 2 v 7 



2 ^ 2 (A3< axgr )V2 _ A^d ma 

Furthermore 

2m-2 ,,o ,9 ,a/o , 2m-2 



v (AW 2 ^)^ _ A 3 / 2 rf max 1 = Q( A 3 / 2 rf 



E A 2 (j ma xgr _ ^ A 2 (i max logm 
(2m - 2r) 2 1 2m J 

2m-2r=2 v ' 



2m-2 



s „ p^*? - o(A3 L» ^ = 0( ^> = ° m ,32) 



2m-2r=uj\ 2 

2m ^ 2(2m _ 2r)2 - 2ro ^ 2 8TO 2 -0( m2 ) (33) 
Remark 1. All previous equations are o(A) because A < A^ = 0(d^ nax \ogn) and 

tax = o(m3- T ). 

Note that we are applying A + B < \[A + \f~B~ to cn r , [3 r , j r ,Cr- B 
Proof (of Lemma 11(b)). Similar to proof of Lemma 10(b) we will show 

/(A0 - T JL +1 S (l + 3/T } 

2d 2 x4>ax ™^|a X ^ 



/O J2 \ "max "max 

<( V ) • n ( i+3 / t 



(2m-2r)2 

m ~i» r) j2 j2 

<(2< ax ) d -- II ( 1 + 3 / T ^Y r +0{ ^ (34) 

r=0 

< e <axl0 S 2<i ma x+E- <ax + 1 

< g d max ( log 2 + 4 'OS dmax + log m + 0(l)) 

< e 4d ma xi°g" (35) 
Where (34) use Lemma 8 and (35) uses m < nc? max /2 and d <C m 1 / 3 < n 1 / 2 ■ 

Proof of Lemma 13 By definition of C : Y^2m-2r=2 ff( r ) = ^. We only need 
to show if \P(Af r ) - &{N r ) < T r (X ) for all r such that 2m - 2r > u\ then: 



m 

2m—2r—uj\o 



So it is sufficient to prove that 



(2m -2r) 2 [> 

The proof is similar to proof of Lemma 10 (b) with a slight modification. Instead 
of (32, 33) we use 

2m-2 .3 ^oo .3 

2m— 2r— wAq 

E m ^maxffr (2m - 2r)rf 2 nax _ . ^a,.^^ _ 

. ,(2m-2r) 2 ^ m 2 ^ m 2 J M 

2m— 2r— 2 v 7 2m — 2r— 2 

and in all others letting A = A they become o(l). ■ 
Proof of Lemma 12 

Proof (of Lemma 12(a)). We have 2m — 2r < lu\ <C (logn) 2 which means 
proving the bound only for one r is enough. Similar to proof of Lemma 15 from 
&p > 2 J 1 one gets A p > 2 J ~ 2 so one of the followings hold: 

(a) G q has more than 2 J / 2-2 edges 

(b) For some u, E ve N (u) <h q {v) > V^ 1 

and rest of the proof will be exactly as in Lemma 15 ■ 
Proof (Proof of Lemma 12(b)). By definition of Bj 

(2m - 2r) 2 



E 9(r)< £ l^^-OiV) 

2m-2r=2 2m-2r=2 



5.6 Proofs of (18) 

From Lemma 14 for all r : 2m — 2r > ui\ the following holds 

F(\V r - i/> r \ > a r (X ) + /3 r (A ) + (1 + d 2 m ^J Am) lr {\ Q ) + Cr(A )) = o(l) (36) 

Let Af be an ordering with \\P r — Vv| < a r (X ) +/3 r (A ) + (1 + d 2 nax /4m)7 r (A ) + 
Cr(Ao) for all 2m — 2r > uj\ a . Then it follows: 

2m-2 



f(\r\-> TT (\ ^/^ a r(A ) +/3 r (A ) + 7r(A ) + Cr(A ) > \ 

m - JL* v ~ iM ^? > 



2m-2r=2 



In section 5.5 it is shown that § ElT-l=^ ^±MM±2r^±M^ = (i). 
Now one can use 1 — x > e~ 2x when < x < 1/2 to see the first term in the 
above product is 1 — o(l). The second term is also 1 — o(l) because a corollary 

of Lemma 8 is that ip r = O (2m — 2r) 2 - l ^* iL and since wAqgP = o(m). These 
together with (36) finish proof of (18). In fact it shows the stronger statement: 
E(/(A0lsW > l-o(l). ■ 

Remark 2. Proofs of this section and section 5.5 yield the following corollary 
which will be used in section 5.7. 

Corollary 2. Setting c in definition of Al very large: 



E exp 



m — 1 



T 2 

r=0 



maxQ^A/;) -$(M),0) 
(2m - 2r) 2 



o(l) 



(37) 



Bounds of sections 5.5, 5.5 and 5.5 show that contribution of the sets Ai \ Ai_\ 
and Bj \Bj_i are all o(l) and contribution of C is l + o(l). Contribution of 
also is o(l) by taking the constant c large enough. 



5.7 Proof of (19) 

In this section we deal with those orderings Af for which the condition 

W)<(l-r/4)( 2m - 2r ) (*) 

is violated for some r. If this happens for some r then from Lemma 8(iii) and 
d 4 = o(m) 



ArW > >W)-^(2m-2r) 2 > ^ r )-r/^ g ^ > (l-r/2)( 2m g 2r 



On the other hand using Lemma 8(i) A r (Af) < dmax( - 2 2 m 2r ^ . So for 2m — 2r > 
3^*2 + 1 we have A r (Af) < (l-T/2)( 2m ~ 2r ). Thus condition (*) is violated only 

for r very close to m. Let S'j(.M), i = 1, . . . , 1 ^J 2 + 1; be the set of all ordering 
Af for which (*) fails for the first time at r = m — i. We will use = °(1) 

to prove (19). It is enough to show: 

E(/(^)l Si )<(l + (l))-L 

Note that C m ~ 2r ) ${M) = E«j) 6 Br ^^(l - %f ) 

because at step r there should be at least m — r suitable edges to complete the 
ordering TV. Hence using d = o(m3~ T ): 

/2m-2r\ ,4 

-5 \ 2 ; < 2m - 2r - 1 + 0( — ) < 2m - 2r 

/2m- 2r\ ,r,/*r \ — ^ ™ / — 



, r )-*W) 



therefore, 



n 



( 2m " 2r ) - 



< 2H\ < 2i l 



and since i is the first place that (*) is violated: 



11 (2m - 2r) _ nK) - exp 



5 yC max(!?(jV r ) - ^(A^ r ),0) 



r=0 



(2m - 2r) 2 



so, 



m-1 /2m-2r\ _ kt x 

Now using Holder's inequality: 



r 5 y^ 1 max(^(jV r ) - &(K), 0) 

r=0 



(2m - 2r) 2 



E(/(#)1 S ,)<M l s , exp 



r 5 1 max(>P(Af r ) - <P(K), 0) 



E 

r=0 



< 2z 4 E((1) Si ) 1 ~ t/2 E 



exp 



(2m- 2rf 



10 %A 1 nuucQP(MQ -ffCA/"r),0r ^ 

_ ^ 

r=0 



(2m- 2r) 2 



But using Corollary 2 the second term in the above product is 1 + o(l) and we 
only need to show: 

2zT(^) 1 -^ 2 < (l + (l))J_. 

Let r = m — i and T(u) = r^, r (u) be the set of all neighbors of u in GV r - Note 
that: 

A r (M) = 1 -j2 di u ) E k-w 

« ver(u)u{u} 



and 



2m - 2r\ 1 



E^E^- 1 — ) 



now A r {N) > (1 - r/2)( 2m - 2r ) > (1 - r)( 2m - 2r ) implies there exist a vertex u 

(r) 

with d u > such that: 

2 «-1«=,)>(1-t)£«-1„=„) 

t)e-r(«)u{«} f 



or 



E ^^E^- 1 — ) 

t)^r(tt)u{«} f 
< r(2m - 2r - 1) < 2ri 



(38) 



Any of last i edges of Af that does not have its both cndpoints inside -T(it) 
contributes at least once in the left side of (38). So there are at most 2ri of such 
edges. Let j = d u — \r(u)\ and i be number of edges entirely in r(u). Then 



we should have j > 1 and I > (1 — 2r)i. The probability that > and 
E^r(u)u{u} <%> — for a fixed vertex u is upper bounded by: 

(t)( ( Y ) )r^ ( ? 3) ) 



hence: 



Now using 



E 

j>l, e>(l-2r)i 



m) < E E 

u j>1, £>(l-2r)i 



(T) 



< 



v 2 , 
I 



< 



m-d u - ( V) 



< 



(?) 

(^/2) £ 
i-j-e 



(i-j-ey. 



and, as i = 0(d 2 ) = o(m 1//2 ), 



= + 



hence: 



P(Si) < 



< 



< 



< 



< 



4 (dj/^Y wLizl 
j\ t\ (i-j-ey. 



(d u / m y(dl/2mYi\ 

mi-j-ty. 



i + «(D)E E 

u j>l. e>(l-2r)i 

i + «(D)E E 

u j>l. i>{l-2r)i 

1 + o(l))2« ^(d„/m)(4/2m)( 1 - 2 ^ i 
1 + o(l))i^ ^(4/2m)( 1 - 2T ) i 2 i 



2-ri 



1 + o(1))j2 24 / 3 ^^(^ 
1 + o(1))2i2 24 / 3 



2 \ (l-2r)i 



m 

(1-2t)» 



(39) 
(40) 
(41) 



here (39, 40) use t < 1/3 and (°) < 2 a . Equation (41) uses d k u = T, u ~ G v( d u~ 1 + 
d^ 1 ) < 2md^. Now we can use i < and (i max < mi~ T to get: 



. ...-/4-4T 

2i i P(S* 4 ) 1 ~ T/2 < (l + o(l))4i 



6 C 

,l-2r 



< (1 + o(l))4i(6 

< (l + o(l))m _Ti 



m 

3t+4t 



6 Probability of Failure of Procedure A 

Before giving the proof consider the following remark. 

Remark 3. Without using any of the known formulas for \£(d)\ Lemma 1 gives 

an upper bound for number of simple graphs with degree sequence d. If cZ max = 
0( m i/4-r) thcn 

nm /2m—2r\ 
,,„,,,, . r=0\ 2 I 

1 Wl - minr=i*i 

In this section we will show that the above inequality is in fact an equality. 

In this section we will show that probability of failure in one round of Procedure 
A is very small. First we will characterize the degree sequence of the partial graph 
generated up to the time of failure. Then we apply the upper bound of Remark 
1 from the beginning of Section 4 to derive an upper bound on probability of 
failure and show that it is o(l). 

Lemma 18. If Procedure A fails in step s then 2m — 2s < c^ ax + 1. 

Proof Procedure A fails when there is no suitable pair left to choose. If the 
failure occurs in step s then number of unsuitable edges is equal to total number 
of possible pairs ( 2m ^ 2s ) • On the other it can be easily shown that the number 
of unsuitable edges at step s is at most rf 2 nax (2m — 2s)/2 (see Corollary 3.1 in 
[45] for more details). Therefore 2m — 2s < e^ lax + 1. ■ 

Failure in step s means there are some Wj's which have unmatched mini- vertices 
(d\ s ^ 0). Let us call them "unfinished" W^s. Since the algorithm fails, any two 
unfinished Wj's should be already connected. Hence there are at most d max of 
them due to Vi : \Wi\ — di < c? max . The main idea is to show this is a very rare 
event. Without loss of generality assume Wi, W2, ■ ■ ■ , Wk are all the unfinished 
sets. Above argument shows k < d max and by construction k < 2m — 2s. The 
algorithm up to this step has created a partial matching M s where graph Gm s 



is simple with degree sequence — (d\ — d\ s , . . . , d^ — a£ , dfc+i, ■ • ■ , d n ). Let 
A (,) ( S ) denotes the above event of failure. Hence 



^max + l niax(d max ,2m- 2s) n 

P(fail)= E E E ■vv,...,-^ 



(42) 



2m-2s=2 



fc=l 



The following lemma is the central part of the proof. 



Lemma 19. Probability of the event that Procedure A fails in step s and vertices 
Vi, . . . ,Vk are the only unfinished vertices; i.e. d^ ^ i = 1, • • • , k is at most 



(1 + 0(1)) 



fe(fe-i) n fc A 

max 1 li = i u i 



m(>)(2m) 2 



2m- 2s 

d {s) d (s) 



Proof. By our notation the above event is denoted by A j S ) 



Note that 



the above discussion shows the graph Gm s should have a clique of size k on 
Vi,...,Vk- Therefore number of such graphs should be less than |£(4 )| where 
4 s) =(d!-4 s) -(fc-l), . . . , d fc -4 a) -(fc-l), ■ • • , d n ). Thus P A (A d ( S , 

is at most |£(4^)| Pa (GmJ- On the other hand we can use Remark 1 to derive 
an upper bound for |£(4^)| because m — s and k are small relative to m and 
the following can be easily shown d max = 0([s — {^}] 1 ^ A ^ T )- The result of these 
gives 



(2s - k(k - 1))! cxp [-A(4 S) ) - A 2 (4 s; ) + o(l) 



p A(V,-,4 s,) 



< 



Pa(G Ms ) 



The next step is to bound Pa (G\m s )- We can use the same methodology as in 
the beginning of Section 4 to derive 



Pa(GaO 



nr=i* 



n- =1 



E 



M s es(M B ) 



s! exp 



( 2 V r )"^(G ! A < .) 



s! exp — A(d) + 
s 



S ' r=0 I 2 J 



Now using the following simple algebraic approximation 

!JA(J)+^A 2 (J)-A(4 S VA 2 (4 S) ) = O (A(d) U(d) - A(4 S ))1) = 0(fe = 0(1) 



the following is true 

p (a s < Mi) Hk 1)]! (2m - 2s)\ s\ 2® flti * 



- [s- (*)]! (2m)! nti [W^K* - * - < (S) + 1)!] 
<e*> — U-i. (43) 

n^vn^^+i)^ ■■■■• d * y 

( fc ) 

Next using m — s = 0(d 2 nliX ) and fc = 0(d max ) we can show n)=i (2 s — 2j + 1 ) > 

m(*) and (l/m 2m - 2s )H 2 ™ 2s+1 £ > e -°«™/ m \ These two added to equation 
(43) finish proof of Lemma 19. I 

Now we are ready to prove the main result of this section. 

Proof (of Lemma 2). First we show that the event of failure has probability of 
o(l) for the case of only one unfinished vertex; i.e. fc = 1. Lemma 19 for k = 1 
simplifies to ¥ A (A dM ) = O ((^) 2m ~ 2s ) therefore summing over all possibilities 
of fc = 1 gives 

E E^a(V) = ° E =0(^ = 0(1) 

2m-2s=2 i=l \ 2m-2s=2 

For fc > 1 we use Lemma 19 differently. Using dmax -1 ^ /mW < d^^/m and 
equation (42) we have 

(a) 



P(fail) < o(l) + =5* ^ 



max(d max ,2ra-2s) „ fc , 

E\ - TT J d f 2m - 2s 

! ' l '^ax k=2 i u ...,i k =li=l ^1 '■■•' d fc 

"' 2^=2 ^ W 2 ™- 2 ' 

Now note that the double sum (a) is at most (di + . . . + d n ) 2m - 2s = (2m) 2m ~ 2s 
since J2i=i = ^ m ~ 2s. Therefore 

P(fail) < o(l) + e °( 1 )^i ^ i = 0(^252L) = (i). 

m 2m-2s=2 m 



7 Running Time of Procedure A 



In this section we prove Lemma 3. 



Proof. Note that for regular graphs the quantities (1 — didj/Am) are fixed for 
all pairs (vi,vj). Therefore step (2) of algorithm A is equivalent to (ignoring the 
random variable P for now): 

(2a) Choose (with replacement) vertex m G V and then vertex Vj € V with 
probabilities proportion to di, dj respectively. Accept the pair (vi,Vj) if 
i 7^ j and (vi,Vj) E. Add (vi,Vj) to E and reduce each of dj, dj by 1. 
Otherwise reject the pair (vi,vj) and repeat step (2a). 

Stegcr-Wormald [45] have used the above implementation and showed the run- 
ning time of Procedure A for regular graphs is 0(ncP)=0(m d max ) per trial. 
Now for general graphs we use the following modified version: 

(2b) Choose (with replacement) vertex Vi <G V and then vertex Vj € V with 
probabilities proportion to di , dj respectively. Toss a coin which comes head 
with probability 1 — didj/Am. Accept the pair (vi,Vj) if coin shows head, 
i 7^ j and (vt,Vj) ^ E. Add (vi,Vj) to E and reduce each of di, dj by 1. 
Otherwise reject the pair (vi,vj) and repeat step (2b). 

It is not hard to see that steps (26) and (2) choose any suitable pair with 
equal probability. This is because in each trial of (26) probability of choosing a 
suitable pair (vi,Vj) is proportion to didj(l — didj/Am). Moreover probability 
that one trial of (26) does not produce a pair is fixed for all pairs (vi ,Vj). 

Since d max = Oim v l^~ T ) we have 1 — didj/Am > 1/2. Thus expected running 
time of (26) is at most twice the expected running time of (2a). The rest of the 
analysis is just repetition of Steger-Wormald's argument in page 19 of [45] for 
regular graphs. Their argument only uses the fact that all degrees are at most 
rfmax which can be easily generalized to general degrees. We have included this 
generalization after the proof. 

Before finalizing the proof we still need to explain how p t j is calculated for 
any suitable pair (i,j) that is chosen in step r. This should be done in order to 
update random variable P correctly. Note that using calculations of section 5.1: 

(1 - d t d 1 /Am)d ( [ ) d { [ ) 

rp. ■ — -L 

13 /2ro-2r\ _ a _ Al,i +Ar < 2 , A r,3 ' 
V 2 ) 8m + 4m 

therefore if we know A r and each A r ^ at step r of the algorithm then calculation 
of pij will be easy. At the beginning of the Procedure A we have 

u v 7 

which can be calculated with 0(n) operations. And at each step it can be updated 
via 0(d max ) operations. This is because by choosing a pair at step r: 

Am-i - A- = dt ] d^ - ]T d^ + £ 4 r) 



and each summation has at most <i max elements. We will explain a data structure 
below that shows how exactly every summation can be calculated with 0{d max ) 
operations. Similar argument shows that each A r ^ : r = 1, 2, 3 can be calculated 
using 0(n) operations at step and can be updated with 0(<i max ) operations 
at any step. Thus the expected running time of the new implementation of 
Procedure A is 0(md max ) per trial for general degree sequences. Now using 
Lemma 2 the expected running time of the Procedure A is of 0(mcZ max ). ■ 

Generalization of Steger-Wormald's Analysis of Running Time. Steger-Wormald 
(SW) implementation is in three phases and uses the configuration model. 

First phase is to have all mini-vertices in an array L where all matched 
mini- vertices are kept in the front. It is also assumed that members of each 
pair of matched mini-vertices will be two consecutive elements of L. There is 
another array / that keeps location of each mini- vertex inside array L. Then 
two uniformly at random selected elements of L can be checked for suitability in 
time 0(d max ) using I. 4 We can also update & r using I via 0(d ma ,x) operations. 
Repeat the above till a suitable pair is found then update L and /. 

Phase 1 stops when number of remaining mini- vertices falls bellow 2e^ ax . 
Hence using Corollary 3.1 in [45], throughout phase 1 number of suitable pairs 
is more than half of total number of available pairs. Therefore expected number 
of repetitions in the above process is at most 2. This means expected running 
time of phase 1 is 0(md max ). 

Phase 2 is when number of available mini-vertices is less than 2d^ ax and 
number of available vertices is at least 2d max . In this phase instead of choosing 
mini- vertices, choose a pair of vertices of G (two random set Wi,Wj in configura- 
tion model) from the set of vertices that are not fully matched. Repeat the above 
till i,j is not already an edge in G. Again the expected number of repetitions 
is at most 2. Now randomly choose one mini- vertex in each selected W{. If both 
of the mini-vertices are not matched yet add the edge, otherwise pick another 
two mini- vertices. Expected number of repetitions here is at most 0(c^ ax ) and 
hence expected running time of phase 2 is at most 0(d 4 nax ). 

Phase 3 is when number of available vertices (Wj's) is less than 2<i max . We 
can construct a graph H, in time 0(c^ ax ), that indicates the set of all possible 
connections. Now choose an edge of H uniformly at random and accept 

it with probability didj / d^^. Again expected number of repetitions will be at 
most 0(e^ ax ). Update H in constant time and repeat the above till H is empty. 
Therefore expected running time of phase 3 is also 0(d^ ax ). 

Hence the total running time for d max = 0(m 1 / 4 ~ T ) will be 0(md max ). 

8 Bounding the Variance of SIS Estimator 

In this section we will prove two variance bounds from Section 4. We will borrow 
some notations and results form Section 5. 

4 Note that in our modification the pair is accepted with probability 1 — did j /4m 
depending to which Wi,Wj they belong. 



8.1 Proof of Equation (3) 

It is easy to see that equation (3) is equivalent to Ea(N 2 ) /Ea(N) 2 < 1 + o(l). 



i \ 2 ... i 



On the other hand analysis of Theorem 1 gives us the estimate 

nr= 1 (i+«(i))[( 2 V r )-^] 



Therefore: 



ml ^W^r 

^a(N 2 ) _ J2gJ2m ( m \yj v a (m) 
E A (iV) 2 ~ \C(d)\ 2 



J2g Em rir=o 



(""-"•) -g r (A<) 



m\\C{d)\ 

E G E(g(A^)) 

|£(J)| 



(44) 



where g(Af) = Yi"=o " (%™-^ r ^^ > arL d E is with respect to uniform distribu- 
tion on set of all ml orderings, S(A4). We will show that for all graphs G € C(d) 
the following holds: 

E(s(A0)< l + o(l) (45) 
Note that equations (44, 45) finish the proof. Thus we only need to prove (45). 

Proof of Equation (45). Before starting the proof it is important to see that 
g(N) = /(Af)^ 1 and the aim of Section 5 was to show E(/(AT)) = 1 + o(l). In 
this section we show that concentration results of section 5 are strong enough to 
bound the variance of g{N) as well. 

Recall definitions for variables \ and T(\i) from Section 5. Here we will 
consider a different partitioning of the set S(M). Define subsets F C F\ C 
. . . C F L C S(M) as follows: 

Fj = {TV e S(A1) | Vr - ^(A^) < T r (Aj) : V < r < m - wA;/2} 

and Foo = 5 (At) \ L)f =0 Fi. The following two lemmas are exact analogous to 
Lemmas 10, 13. 

Lemma 20. For all 1 < i < L 

(a) P u (F t \F t ^)<e-^ 

(b) g(N)<e°^ : VW 



Lemma 21. For all M e To, g(M) < 1 + o(l) 



Proof of these Lemmas is exactly similar to proofs for Lemmas 10 and 13. We 
only need the following extra piece 

2m-2r=2 2 

Then for Lemma 20 we use " A ^" ax = o(A) and for Lemma 21 we use " A °^ max = 
o(l). Combination of these two lemmas gives K(g(J\f)) < 1 + o(l). ■ 



8.2 Proof of Equation (4) 

Same as before we will use lemmas from Section 5. But the main technical point 
in this section is a new result which exploits combinatorial structure to obtain 
a stronger bound for controlling the variance. 
Equation (4) is equivalent to 

For that we start with a bit of algebraic calculations 

E B (P 2 ) _ ml Em f a(M) 2 _ E(/(AQ 2 ) 
E B (P) 2 Pa(G) 2 E(/(.A0) 2 

Therefore all we need to show is E(/(W) 2 ) = 1 + o(l). 

Consider exactly the same partitioning of the set S(A4) as in Section 5. It is 
straightforward to see that Lemmas 10, 11, 12, 13 give us the following stronger 
results as before: 

E(f(M) 2 l A ) = o(l) 
E(/(.A0 2 1 B ) = 
E(/(A0 2 l c )<l + o(l) 

and the only missing part is the following 

WW 2 l S *(M)\s*(M)) = o(l) (46) 
which we will prove by exploiting combinatorial structure. 

Proof of (46) In this section we deal with those orderings N for which the 
condition 

W)<(l-r/4)( 2m " 2r ) (*) 



is violated for some r. If this happens for some r then from Lemma 8(iii) and 
d 4 = o(m) 



A r (N) > y{N r )- — {2m-2r) 2 > W{K)-t/A 



/2m - 2r 



> (l-r/2) 



2m- 2r 
2 



On the other hand using Lemma 8(i) from Section 5: A r {N) < d ( - 2m ~ 2r ) . So 
for 2m - 2r > + 1 we have A r {M) < (1 - r/2) ( 2m ^ 2r ) . Thus condition 

(*) is violated only for r very close to m. 

Lemma 22. Let n r be the number of available vertices (vi's with Wi ^ 0) at 
step r then: 



A r {Af) < 



(2m - 2r) 



2 r 



2m- 2r 



«ax + l) 2 



(m— r) < 



1 



2m- 2r 
«ax + l) 2 



2m — 2r 
2 



Proof. We know Z\ r = cfp 0*2°) ' ^ n OT dcr to bound the first sum, 

note that each d\ r ^ in the first sum appears at most n r — 1 — d\ r ^ times (exclude 

(r) 

the vertex i). It also appears at most d{ — d\ times. By Hardy-Polya-Littlewood 
inequality such a sum is less than ^,^ ^-^)-^ , Therefore 



* < E 0* w ) 



( r )\ 2 min(n r ,rfi + 1) — d, 



(r) 



(m — r) 



Now a simple Lagrange multiplier method (using J2i ^ = 2m — 2r and c?i < 



dmax) tells us the above sum is at most n r ^"^f^ 
(m — r) therefore 

A r (J\f) + (m- r) < 



min(n r ,d ma x + 1) - 



2m-2r 




(d max + l) a 
2m-2r 



+ 1)2 

XT -Ly 



if n r < G? ma x + 1. 
Otherwise. 



The last inequality comes from the fact that n r < 2m — 2r < 2d 2 . Therefore it 
is minimized when n r = d max + 1- B 

Now similar to Section 5.7 define Si(A4), i = 1, . . . , jSrpi + 1, be the set of all 
ordering Af for which (*) fails for the first time at r — m — i. We will show: 
E(/ W 2 l Si ) < (l + o(l)) ~tt and use ££i m"™ = o(l). 
Lemma 22 gives us: 



2m - 2r 
2 



<W) > 



A r {M) 



> 



> 



2m - 2r x 
2 

2m - 2r\ 2m - 2r 



1) ; 



d 2 

/i max \ 

1 4m J 
d 2 

/-i max \ 

" l ~ Am' 



2m — 2r\ m — r 
2 / ^ m3 v 



I2m-2r\ 



Note also that for m — r < d max we may bound 
bym-r since there are at least m — r edges left. Therefore 



A r {M) from below 



( 2m 2 2r ) tpr < \ iaa^ < d max if m - T > d n 

( 2m 2 - 2r ) - &(K) ~ \ 2(m - r) < 2rf max Otherwise. 



therefore, 



- 1 /2m-2r)_^ 



n 



2 min ( d max,i) ( |i 



From here we will closely follow the steps taken in Section 5.7. Since i is the first 
place that (*) is violated: 



m—i—l (2m—2r\ 
2 ) 



TT V 2 J-#(M) 
11 ( 2 m -2 r) _ W) 



< exp 



5 ^ max(^(jV r ) - ^(7V r ),0) 



E 

r=0 



(2m - 2r) 2 



so, 



f(Af)i Si = is, n 



m— 1 (2m— 2r\ 
2 ) 



JUL ( 2 m -2 r) _ W) 



< 2 min(<i majt , exp 



5 ^ max^^r) - f (MO, 0) 



r=0 



(2m- 2r) 2 



Now using Holder's inequality: 

E(/(A0 2 lsJ 
<2 2min ( d — ^^ ax E(l Si exp 



10 "V^ 1 max(<f {Mr) - &(Af r ), 0) 

T ^ 

r=0 



(2m- 2r) 2 



< 2 min < d — i ^ 2 ^ ax E(l Si ) 1 - r/2 E l St exp 



20 m <A 1 max(^(7V r ) - &(K), 0) 
t ^ 

r=0 



(2m - 2rf 



Again by Corollary 2 the second term in the above product is 1 + o(l) and we 
only need to show: 

di^nsi) 1 -^ 2 < (i+o(i))-l-. 

Now using the bound given by equation (41) for P(Sj) we have: 

'22+2min((i max ,i)/i^4-4T 



2 ™(rf m ax,<) d 2i axP ^.)l-r/2 < ( 1 + (!)) 4z 



m 

< (l + o(l))4z(6m- 3r+4r2 ) 4 

< (l + o(l))m- 



l-2-r 



9 Generalized Chernoff Bound: Proof of Lemmas 6, 7 

Proof of Lemma 6. Note that in the G p model the residual degree of a vertex 
u, d u , is sum of d independent Bernoulli random variables with mean q. Two 
generalizations of the Chernoff inequality (Theorems A. 1.11, A. 1.13 in page 267 
of [3]) state that for a > and X\, . . . ,Xd i.i.d. Bcrnoulli(q) random variables: 

a 2 a 3 2 

P(X 1 + ---+X d -qd>a)<e~^ 3+ ^? r , P(Ai + - • -+X d -qd < -a) < 



Applying these two for a = ^I2qd logn Lemma 6 directly follows. ■ 

Proof of Lemma 7. For any vertex u let Nq{u) C V(G) denotes neighbors of 
u in G. Set Ag(u), E>g(u) 7 Cg(u) C E(G) where Ag(u) is the edges that are 
connected to u, P>g(u) is those edges with both endpoints in Ng(u) and Cg(u) 
consists of the edges with exactly one endpoint in Ng(u). For any edge e of G 
let t e — l{ e ^ Gp j. Then we can write 

]T (d u ,-(d-l)q) 

u'~ Gp u 

= £ E (*.-«) 

u'eN Gp (u) eeA a (u')\A G (u) 

= E E (*«-«)- E £(*«-«) 

ii'£iV G (u) eeA G (u')\AG(«) M'eA r G (M)\A r G p («) eeA G (ti')\A G («) 

= J2 (*e"9)+2 E (*«"«)- E K-l-«(d-l)) 

eeCcW e£B G (u) u'£iV G (u)\iV G (u') 

' v ' ^ ' 



{%) (ii) (iii) 

each of (i) and (ii) are sum of 0(d 2 ) i.i.d. Bcrnoulli(q) random variables mi- 
nus their expect ations and similar to Lemma 6 can be shown to be less than 
0(\/l2qd 2 logn) with probability at least 1 — o(l/m 4 ). For (iii) we can say 

£ « ~ 1 - q(d- 1)) < d„ max (K< - 1 - 

u ' e iM«)VW) «'eiv G („)\iv G K) 

now using Lemma 6 twice we can say (iii) is of ( [dq + ^/I2qd logn ] \JYlqd logn ) 
with probability at least 1 — o(l/m 4 ). These finish proof of Lemma 7. ■ 
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